IN THE CLAIMS 



1 (Currently Amended). A method comprising: 

detecting a color characteristic; 
detecting motion; and 

removing the flesh color from ^ captured video. 

2 (Original). The method of claim 1 including controlUng a processor-based system 
based on the detection of flesh color and the detection of a shape associated with a human being. 

3 (Original). The method of claim 2 including determining whether to process image 
data depending on whether both motion and flesh are detected. 

4 (Original). The method of claim 2 including capturing a frame of video at a time, and 
determining after capturing each frame whether or not flesh color has been detected. 

Claim 5 (Canceled). 

6 (Previously Presented), The method of claim 1 including moving an animation object 
while capturing video and removing the detected flesh color from the captured video. 

7 (Original). The method of claim 1 including capturing video of an animation object in 
a plurality of different positions and automatically removing an image of a user's hand from the 
captured video. 

8 (Previously Presented). An article comprising a medium storing instructions that, if 
executed, enable a processor-based system to: 

detect a color characteristic; 
detect motion; and 

remove the flesh color from captured video. 
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9 (Original). The article of claim 8 further storing instructions that enable the processor- 
based system to be controlled based on the detection of flesh color and the detection of a shape 
associated with a human being. 

10 (Original). The article of claim 9 further storing instructions that enable the 
processor-based system to determine whether to process image data depending on whether 
motion and flesh are detected. 

1 1 (Original). The article of claim 9 further storing instructions that enable the 
processor-based system to capture a fi-ame of video at a time and determine after capturing each 
frame whether flesh color has been detected. 

Claim 12 (Canceled). 

13 (Previously Presented). The article of claim 1 further storing instructions that enable 
the processor-based system to capture video of an animation object in a plurality of different 
positions and automatically remove an image of a user's hand from the captured video. 

14 (Currently Amended). A system comprising: 

a processor; and 

a storage coupled to said processor storing instructions that, if executed, enable 
the processor to detect motion and a skin color characteristic and remove the skin color 
characteristic from captured video. 

15 (Original), The system of claim 14 wherein said storage further stores instructions 
that enable the processor to detect a shape associated with a human being. 

16 (Original). The system of claim 14 further storing instructions that enable the 
processor to determine whether to process image data depending on whether motion and flesh 
color are detected. 
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17 (Original). The system of claim 14 including a digital imaging device coupled to said 
processor. 

18 (Previously Presented). A method comprising: 

capturing a video image of a speaker; 

receiving audio information from the speaker through at least one microphone; 
determining the user's position using said captured video image of said speaker; 

and 

based on the user's position, automatically adjusting a characteristic of the 

microphone. 

19 (Original). The method of claim 18 including receiving audio information from a pair 
of microphones and adjusting the sensitivity of the microphones based on the relative positioning 
of the user with respect to each microphone. 

20 (Original). The method of claim 18 including tracking the user's facial position in two 
dimensions and estimating the user's facial position in a third dimension. 

21 (Original). The method of claim 18 including tracking the user's facial position in 
three dimensions. 

22 (Original). The method of claim 18 including using a point of source filter to adjust the 
audio information received from the user and providing said adjusted audio information to a 
speech recognition engine. 

23 (Previously Presented). A system comprising: 

a video capture device to capture an image of a user; 

at least one microphone to capture speech from said user; 

a device to determine the user's position with respect to at least two microphones 
and to adjust the data from each microphone in response to the user's position relative to each 
microphone, using the image captured by said video captiire device. 
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24 (Original). The system of claim 23 including a pair of video cameras for capturing an 
image of said user. 

25 (Original). The system of claim 23 including a two dimensional face tracker that 
locates the user's face in two dimension. 

26 (Original). The system of claim 23 including a three dimensional face tracker that 
locates the user's face in three dimensions. 

27 (Original). The system of claim 23 including a point of source fdter to adjust the 
sensitivity of said microphones. 

Claims 28-30 (Canceled). 
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